Clustering Techniques for Databases of Cad Models

ثبت نشده
چکیده

Computer-Aided Design data and Solid Models are of immense importance across the full spectrum of engineering disciplines. Current industrial estimates point to the existence of over 30 billion CAD models in the world, with increasing numbers of them including 3D solid models. While the database and data mining communities have made great strides in management of image, audio and video media, little work exists on how to perform engineering, content-based mining of CAD data and solid models. CAD data, and the engineering consumers of CAD data, have greatly different needs and requirements from traditional database multimedia: large, individual database elements; multidisciplinary components (mechanical, electrical, etc.) with unique access critera; and the lack of an accepted set of readily identifiable features that can be used for model indexing and clustering. This paper presents our approach to perform similarity-based clustering on large databases of solid models of mechanical CAD designs. A publicly available source of solid models is the National Design Repository (http://www.designrepository.org), which contains over 55,000 publicly available solid models from many different engineering disciplines. It is this repository which we test our techniques against. We create a mapping of a solid model's boundary representation and engineering attributes into Model Signature Graphs (MSG). From the MSG's, we create an Invariant Topology Vector (ITV) which captures shape and engineering properties of the CAD models. We use a-clustering algorithm to group the models and provide some of our empirical results to illustrate the approach. We believe that our methodology can form the basis for adapting data mining technquies to large engineering databases and building tools for handling 3D solid models as database media.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach

Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...

متن کامل

Shape-based Clustering Of Enterprise CAD Databases

Cluster analysis is a primary data mining method for knowledge discovery in spatial databases, where, the goal is to find ‘natural’ groups in a dataset based on a similarity or dissimilarity function for pairs of objects. With the number and size of spatial databases in various domains growing rapidly over the last couple of decades, methods for automated knowledge discovery in these datasets i...

متن کامل

Experimental Evaluation of Algorithmic Effort Estimation Models using Projects Clustering

One of the most important aspects of software project management is the estimation of cost and time required for running information system. Therefore, software managers try to carry estimation based on behavior, properties, and project restrictions. Software cost estimation refers to the process of development requirement prediction of software system. Various kinds of effort estimation patter...

متن کامل

Extraction and 3D Segmentation of Tumors-Based Unsupervised Clustering Techniques in Medical Images

Introduction The diagnosis and separation of cancerous tumors in medical images require accuracy, experience, and time, and it has always posed itself as a major challenge to the radiologists and physicians. Materials and Methods We Received 290 medical images composed of 120 mammographic images, LJPEG format, scanned in gray-scale with 50 microns size, 110 MRI images including of T1-Wighted, T...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001